Review of Modern Logistic Regression Methods with Application to Small and Medium Sample Size Problems
نویسندگان
چکیده
Logistic regression is one of the most widely applied machine learning tools in binary classification problems. Traditionally, inference of logistic models has focused on stepwise regression procedures which determine the predictor variables to be included in the model. Techniques that modify the log-likelihood by adding a continuous penalty function of the parameters have recently been used when inferring logistic models with a large number of predictor variables. This paper compares and contrasts three popular penalized logistic regression methods: ridge regression, the Least Absolute Shrinkage and Selection Operator (LASSO) and the elastic net. The methods are compared in terms of prediction accuracy using simulated data as well as real data sets.
منابع مشابه
Penalized Lasso Methods in Health Data: application to trauma and influenza data of Kerman
Background: Two main issues that challenge model building are number of Events Per Variable and multicollinearity among exploratory variables. Our aim is to review statistical methods that tackle these issues with emphasize on penalized Lasso regression model. The present study aimed to explain problems of traditional regressions due to small sample size and m...
متن کاملSample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کاملبکارگیری روش باز نمونه گیری بوت استرپ در رگرسیون لجستیک و کاربرد آن در تحلیل داده های مربوط به بیماران مبتلا به سرطان سینه
Background and Aim: The purpose of this study was to assess the accuracy of the bootstrap method in logistic regression and to explore the method's use in logistic regression models in cases where the sample size is insufficient. Materials and Methods: We use data from 150 patients who had undergone surgery at the Cancer Institute, Emam Khomeini hospital during from 1999 to 2001. Then we drew...
متن کاملFUZZY LOGISTIC REGRESSION: A NEW POSSIBILISTIC MODEL AND ITS APPLICATION IN CLINICAL VAGUE STATUS
Logistic regression models are frequently used in clinicalresearch and particularly for modeling disease status and patientsurvival. In practice, clinical studies have several limitationsFor instance, in the study of rare diseases or due ethical considerations, we can only have small sample sizes. In addition, the lack of suitable andadvanced measuring instruments lead to non-precise observatio...
متن کاملSubstitution of inorganic fertilizers with organic manure reduces nitrate accumulation and improves quality of purslane
Growers often apply high amounts of chemical fertilizers for vegetable production and this application contributes to concerns about nitrate levels in food. An experiment was conducted to investigate soil N amendment effects for reducing the nitrate accumulation and improving the quality of fresh purslane (Portulaca ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010